The Parameterized Complexity of p-Center Approximate Substring Problems
نویسندگان
چکیده
Problems associated with nding strings that are within a speci ed Hamming distance of a given set of strings occur in several disciplines. All of the problems investigated are NP -hard and have varying levels of approximability. In this paper, we use techniques from parameterized computational complexity to assess non-polynomial time algorithmic options for three of these problems, namely p-exact substring (pes), approximate substring (1as), and p-approximate substring (pas). These problems vary whether the substring must be an exact match, and also whether a single substring or a set of substrings (of cardinality p) is required. Our analyses indicate under which parameter restrictions useful algorithms are possible, and include both class membership and parameterized reductions to prove class hardness. Since variation in parameter restrictions will lead to di erent algorithms being preferable, we give a variety of algorithms for the xed parameter tractable problem variations. One of these, for 1as with alphabet, substring length, and distance all xed, is an improvement of one of the best previously known exact algorithms (under these restrictions). Other algorithms solve parameterized variants previously unexplored. We also prove that pes is NP-hard, and show inapproximability for pes and pas. Faculty of Computer Science, University of New Brunswick, Fredericton, NB, Canada. E-mail: {pevans,p7ka}@unb.ca Department of Computer Science, Memorial University of Newfoundland, St. John's, NF, Canada. E-mail: [email protected]
منابع مشابه
On the complexity of finding common approximate substrings
Problems associated with #nding strings that are within a speci#ed Hamming distance of a given set of strings occur in several disciplines. In this paper, we use techniques from parameterized complexity to assess non-polynomial time algorithmic options and complexity for the COMMON APPROXIMATE SUBSTRING (CAS) problem. Our analyses indicate under which parameter restrictions useful algorithms ar...
متن کاملMore Efficient Algorithms for Closest String and Substring Problems
The closest string and substring problems find applications in PCR primer design, genetic probe design, motif finding, and antisense drug design. For their importance, the two problems have been extensively studied recently in computational biology. Unfortunately both problems are NP-complete. Researchers have developed both fixed-parameter algorithms and approximation algorithms for the two pr...
متن کاملParameterized Matching
Two equal length strings s and s, over alphabets Σs and Σs′ , parameterize match if there exists a bijection π : Σs → Σs′ , such that π(s) = s, where π(s) is the renaming of each character of s via π. Parameterized matching is the problem of finding all parameterized matches of a pattern string p in a text t. It was introduced as a model for software duplication detection in software maintenanc...
متن کاملOn the parameterized complexity of approximate counting
In this paper we study the parameterized complexity of approximating the parameterized counting problems contained in the class #W [P ] ; the parameterized analogue of #P: We prove a parameterized analogue of a famous theorem of Stockmeyer claiming that approximate counting belongs to the second level of the polynomial hierarchy.
متن کاملHard problems in similarity searching
The Closest Substring Problem is one of the most important problems in the field of computational biology. It is stated as follows: given a set of t sequences s1; s2; : : : st over an alphabet , and two integers k; d with d k, can one find a string s of length k and, for all i = 1; 2; : : : ; t, substrings oi of si, all of length k, such that d(s; oi) d (for all i = 1; 2; : : : ; t)? (here, d(:...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001